Toward detecting voice activity employing soft decision in second-order conditional MAP
نویسندگان
چکیده
In this paper, we propose a novel approach to statistical modelbased voice activity detection (VAD) that incorporates a secondorder conditional maximum a posteriori (MAP) criterion. As a technical improvement for the first-order conditional MAP criterion in [1], we consider both the current observation and the voice activity decision in the previous two frames to take full consideration of the inter-frame correlation of voice activity. The soft decision scheme is incorporated to result in time-varying thresholds for further performance improvement. Experimental results show that the proposed algorithm outperforms the conventional CMAP-based VAD technique under various experimental conditions.
منابع مشابه
Statistical Model-Based Voice Activity Detection Based on Second-Order Conditional MAP with Soft Decision
© 2012 ETRI Journal, Volume 34, Number 2, April 2012 In this paper, we propose a novel approach to statistical model-based voice activity detection (VAD) that incorporates a second-order conditional maximum a posteriori (CMAP) criterion. As a technical improvement for the first-order CMAP criterion in [1], we consider both the current observation and the voice activity decision in the previous ...
متن کاملImproved Global Soft Decision Incorporating Second-Order Conditional MAP in Speech Enhancement
In this paper, we propose a novel method based on the second-order conditional maximum a posteriori (CMAP) to improve the performance of the global soft decision in speech enhancement. The conventional global soft decision scheme is found through investigation to have a disadvantage in that the global speech absence probability (GSAP) in that scheme is adjusted by a fixed parameter, which could...
متن کاملA voice activity detector employing soft decision based noise spectrum adaptation
In this paper, a voice activity detector (VAD) for variable rate speech coding is decomposed into two parts, a decision rule and a background noise statistic estimator, which are analysed separately by applying a statistical model. A robust decision rule is derived from the generalized likelihood ratio test by assuming that the noise statistics are known a priori. To estimate the time-varying n...
متن کاملVoice activity detection based on conditional MAP criterion incorporating the spectral gradient
In this paper, we propose a novel approach to improve a statistical model-based voice activity detection (VAD) method based on a modified conditional maximum a posteriori (MAP) criterion incorporating the spectral gradient scheme. The proposed conditional MAP incorporates not only the voice activity decision in the previous frame as in [1] but also the spectral gradient of the observed spectra ...
متن کاملVoice activity detection using global soft decision with mixture of Gaussian model
An improvement on the voice detection algorithm using global soft decision (GSD) is made in this paper. In GSD method, the speech and noise are modelled by the presumed probability density function, e.g. Gaussian pdf. We propose that the estimation and modelling of the signal is done in the domain of filterbank output which widely used in most speech processing applications. Since the output of...
متن کامل